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Abstract 

A complex web of roads, walkways and public transport systems can hide areas of geographical isolation 
very difficult to analyze. Random walks are used to spot the structural details of urban fabric. 

PACS: 89.65.Lm, 89.75.Fb, 05.40.Fb, 02. 10. Ox 

Keywords: Space syntax, Queuing networks, Random walks 



Contents 6 Petersen graph and Venetian Canals ll2| 



Introduction Ufl 7 Discussion and Conclusion 15 



City space syntax as a queueing network 

with a steady-state behavior Q List of Figures 

2.1 Steady states in queuing networks . . 

2.2 Graph automorphisms and steady states S 1 The cumulative distributions of far-away 

neighbors in the dual graph of Bielefeld 
Small worlds of big cities downtown. The cumulative degree distri- 
bution are shown in the first row. The cu- 
From random walks to Euclidean spaces mutative distributions of far-away neigh- 
bors are encoded in the second and forth- 
Euclidean embedding coming rows. Probability is ranked from 
of graphs by random walks [l^ zero (dark blue) to 1 (red) 



1 



10 



The cumulative distributions of far-away 
neighbors in the dual graphs of Rothen- 
burg ob der Tauber, Bavaria. The cumu- 
lative degree distribution are shown in the 
first row. The cumulative distributions of 
far-away neighbors are encoded in the sec- 
ond and forthcoming rows. Probability is 
ranked from zero (dark blue) to 1 (red). . 

The cumulative distributions of far-away 
neighbors in the dual graph of Venetian 
canal network (96 canals). Probability is 
ranked from zero (dark blue) to 1 (red). . 

The cumulative distributions of far-away 
neighbors in the dual graph of the street 
grid in Manhattan. Probability is ranked 
from zero (dark blue) to 1 (red) 



The Petersen graph. 



The dual graph representation of the spa- 
tial network of 96 Venetian canals . . . . 

The Euclidean space embedding of the 
Petersen graph drawn with respect to the 
node #1 

The spectrum of the Markov transition 
operator (|19[) defined on the spatial net- 
work of Venetian canals 



The scatter plot (in the log-log scale) of 
the connectivity vs. the value of access 
time to a node in the dual graph repre- 
sentation of 96 Venetian canals. Three 
data points characterized by the short- 
est access times represent the main water 
routes of Venice: the Lagoon of Venice, 
the Giudecca canal, and the Grand canal. 
Four data points of the worst accessibility 
are for the canal subnetwork of Venetian 
Ghetto. The slope of the regression line 
equals 2.07 

The scatter plot of the connectivity vs. 
the norm a node in the dual graph rep- 
resentation of 50 streets in the downtown 
of Bielefeld. The slope of the regression 
line equals 2.17 



11 The scatter plot (in the log- log scale) of 
the mean shortest distance vs. the value 
of access time for the network of Vene- 
tian canals. The plot indicates a positive 
relation with the slope of regression line 
equals 0.18 

12 The 2-dimensional projection of the 95- 
dimensional Euclidean spaces associated 
to random walks defined on the city canal 
network built from the perspective of the 
Grand canal of Venice chosen as the ori- 
gin. The labels of the horizontal axes dis- 
play the expected number of random walk 
steps. The labels of the vertical axes show 
the degree of nodes (radiuses of the disks). 



a 

B 

13 



13 



13 



14 



14 



2 



1 Introduction 

In the present paper, we discuss the application of 
Markov processes to urban studies. However, it is 
important to emphasize that the approach related to 
Markov processes (such as random walks) can be used 
to analyze any type of complex networks. The main 
technical idea beyond the method is to investigate 
the properties of networks by exploiting the spectral 
properties of self-adjoint operators (symmetric matri- 
ces) defined on their graph representations. It is then 
obvious that a network can be effectively investigated 
by spectral methods, if it is not too large. The spatial 
networks of human settlements give good examples - 
for instance, the city of Paris (inside the Periphery 
Boulevard) contains only 5131 interconnected open 
spaces. 

A city converts a space pattern into a pattern of 
relationships. While discussing the impact a spa- 
tial structure has on the social establishment, it is 
inevitable to mention two Englishmen. After the 
British House of Commons had been destroyed in 
1941, a long debate commenced in the Parliament 
on whether the old House insufficient to seat for all 
of its members be rebuilt or a new modern House 
should be constructed. In his speech to the meet- 
ing in the House of Lords, October 28, 1943, Sir 
Winston Churchill exclaimed, "We shape our build- 
ings, and afterwards our buildings shape us" pQ, as 
if he believed that the movement pattern emerged 
in the old House of Common is responsible for the 
establishing and sustain of the British political tradi- 
tion. This intuitive knowledge had been conceived 
by another British gentleman, Bill Hillicr, profes- 
sor at University College of London, within the con- 
cept of space syntax, a theory developed in the late 
1970s, that seeks to reveal the mutual effects of com- 
plex spatial urban networks on society and vice versa 
[2j [3] . The joint use of scarce space by many peo- 
ple creates life in cities which is driven largely by 
micro-economic factors which are invariant over var- 
ious cultures that tends to give cities similar struc- 
tures. Hereby, the street configuration itself natu- 
rally creates differential patterns of occupancy (as 
an emergent phenomenon), whereby some streets be- 
come, over time, more highly used than others [4]. At 



the same time, a background residential space process 
driven by cultural factors tend to make cities differ- 
ent from each other, so that the emergent urban grid 
pattern containing an imprint of certain geometrical 
regularities forms a network of interconnected open 
spaces, being a historical record of a city creating pro- 
cess driven by human activity and containing traces 
of society and history [2]. 

It is important to note that from the very begin- 
ning, graph theory had been developed in a close re- 
lation to urban studies: Leonard Euler's founding pa- 
per on the Seven Bridges of Konigsberg published in 
1736 is regarded as the first paper in the history of 
graph theory, [5]. In the present paper, we review 
the spectral algorithms designed to analyze trans- 
port networks. In particular, the method proves its 
efficiency for revealing the disadvantageous areas de- 
signed to foster crime, deprivation, and ghettoisation, 

In the forthcoming sections, we explain that city 
space syntax can be considered as a queuing network 
operating in the free flow regime (see Sec. The 
steady state of a queuing network (see Sec. 12. 1[) is 
determined by the set of automorphisms of the graph 
spanning the network (Sec. 12. 2| ). We shall show that 
these automorphisms naturally correspond to ran- 
dom walks of the nearest neighboring type, in which 
a traveller, after passing through an open space, ran- 
domly chooses another space to move in among those 
intersecting with the given one. How many possibil- 
ities to move the walker has while in an arbitrary 
place? - Indeed, it depends upon the type of the city 
he dwells in. We have analyzed two German medieval 
organic cities, the city canal network in Venice, and 
the regular street grid in Manhattan and found that 
while the organic towns are relatively compact (the 
majority of streets in them are just three steps apart 
from each other), the bigger urban patterns demon- 
strate the structure of small worlds, in which a dense 
network of relatively short local streets is supported 
by the long axial itineraries connecting the far-away 
districts within the city that essentially simplifies the 
navigation and transportation tasks in the urban en- 
vironment (see Sec. [3]). Being a representation of 
the set of graph automorphisms, random walks pro- 
vide us with an effective tool for describing the graph 
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structure in details (see Sec.lHE]). 111 Sec. El we illus- 
trate the method by studying two graphs by random 
walks. The first graph is the regular Petersen graph 
which contains just 10 nodes, and we investigate the 
network of Venetian 96 canals as the second example. 

The term "random walk" was originally proposed 
by K. Pearson in 1905 in his letter to Nature devoted 
to a simple model describing a mosquito infestation 
in a forest: at each time step, a single mosquito moves 
a fixed length, at a randomly chosen angle (see [7]). 
Nowadays it is well known that random walks could 
be effectively used in order to investigate and charac- 
terize how effectively the nodes and edges of large net- 
works can be covered by different strategies, [51 151 ITU], 
Random walks help us to explore the structural prop- 
erties of complex urban networks which cannot be 
detected by the usual methods designed primarily to 
unravel the structures of large quasi-random graphs. 
We conclude in the last section (Sec[7]). 

2 City space syntax as a queue- 
ing network with a steady- 
state behavior 

Any graph representation naturally arises as the 
outcome of a categorization, when we abstract a real 
world system by eliminating all but one of its features 
and by grouping together items (or places) sharing 
a common attribute. For instance, the common at- 
tribute of all open spaces in city space syntax is that 
we can move through them. All elements called nodes 
that fall into one and the same set V are considered 
as essentially identical; permutations of them within 
V are of no consequence. The symmetric group §at 
consisting of all permutations of N elements (N be- 
ing the cardinality of the set V) constitute therefore 
the symmetry group of V. If we denote by E C VxV 
the set of ordered pairs of nodes called edges, then a 
graph is a map G(V, E) : E — > K C M + . We sup- 
pose that the graph is undirected and has no multiple 
edges. If two nodes are adjacent, € E, we write 
i ~ j. The degree of the node i € V is the number of 
its neighbors in G, deg G (i) = fcj. 

The set of interconnected urban open spaces in 



which a pedestrian or a vehicle, upon departures 
from one space, can join another constitutes a queue- 
ing network that can be regarded as a graph whose 
nodes represent the open spaces (streets, squares, 
and round-abouts), and whose edges represent links 
between nodes (junctions and crossroads). In the 
queueing theory [11], the paths along which a trav- 
eller may move from service station to service sta- 
tion are determined by the routing probabilities, and 
then the theory of Markov chains provides the sta- 
tistical models for the analysis of queueing networks. 
Travellers arriving to an open space are either moving 
through it immediately or queuing until the space be- 
comes available. Once the space is passed through, 
the traveller is routed to its next station, which is 
chosen according to a probability distribution among 
all other open spaces linked to the given one in the 
city. However, if the destination space has finite ca- 
pacity then it may be full and then the traveller will 
be blocked at its current location until the next space 
becomes available. 

2.1 Steady states in queuing networks 

If the network of N open spaces is not at all con- 
gested (as long as inflows are compatible with system 
capacities), there is a network flow - say, cars are trav- 
elling through it at some constant speed v > in the 
free flow regime. Given (X t :t>0), a i.i.d. random 
process characterized by some distribution / defined 
on V and let pt > be the arrival probability at the 
node i € V, in a sequence of n independent Bernoulli 
trials, then travellers visit the node i in average npi 
times. If trials are conducted in a finite time interval 
t, during which events occur in sequence, we denote 
the proportion of time spent by a traveller in i by 
Ai > 0, 

npi = Ajt, (1) 

The above formula expresses the linear Ohm's law 
relating the time variable with the number of inde- 
pendent Bernoulli trials, t = (j>i/\i)n. 

Assuming the Kirchoff 's current law, i.e. the total 
flow into any node equals the flow out, then the pro- 
portions Ai can be obtained as the solutions of the 
global balance equations (transport equations) satis- 
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fying a normalization condition, 

= 1. (2) 

i<£V 

In the queuing network load in the stationary regime 
with free flow, the steady states (or densities) A; > 

can be interpreted as the length distances of the 
correspondent streets i £ V normalized by ^ . 

In relation to a given distribution of Xi, there are 
several "performance indicators" characterizing the 
transport properties of the queuing network with re- 
spect to the random process (Xt : t > 0). Given the 
arrival probability pi — tXi/n, no matter from which 
node Xt=o = j the process starts, the first arrival 
probability (FAP) that a traveller arrives at the node 

1 EV by time t' > t is given by P (t' > 1 1 Xt' = i) = 
(1 — t ■ Xi/n) n . First encounter properties estimated 
by random walks play a crucial role in explorations 
of various real- world networks, including epidemic 
spreading, transport in disordered media, and neu- 
ron firing dynamics (see [12] and references therein). 
In the limit of an infinite number of trials n — > oo, 
the FAP is asymptotically exponential, 

lim P(t' > 1 1 X v = i) = e~ x *\ (3) 

n — >oo 

provided (X t : t > 0) preserves the proportion of so- 
journ time Xi given by ([T]). An exponential distri- 
bution arises naturally when modelling the time be- 
tween independent events that happen at a constant 
average rate. The typical recurrence time which char- 
acterizes the random duration elapsed between two 
consequent arrivals of the traveller at i £ V in the 
free flow regime described by the process (X t : £ > 0) 
is accounted by the probability density function (pdf) 
Fi(t) = Xie~ Xi *. Then the expected first-passage time 
(FPT) to the node i £ V from a node randomly cho- 
sen among all nodes in the networks is given by 

1 f°° 

Y = / tF ^) dt W 
M Jo 

In complex networks, the value of FPT crucially de- 
pends on the confining environment and is not de- 
termined by the simple relation ([?]). The FPT has 
been recently recognized as a key quantity to speci- 
fying the transport-limited kinetics in many models 
of transport in complex media, 12|. 



Other characteristic time can be used in order to 
quantify the long-run properties of the entire net- 
work. Let 1 = mm (t : Ut=o{^} = V^j be the 
covering time (also: the coupon-collector's time) re- 
quired the traveller visits all nodes of the closed queu- 
ing network (say, rides along all streets in the city). 
The value of 1 crucially depends upon the local trans- 
port properties of the network, [13] . It may be that 
T = oo if the graph spanning the network is directed, 
and the process (X t : t > 0) defined on that is irre- 
versible. The typical value of T for a finite queuing 
network is given by the unique solution r of the equa- 
tion 

E e ~ A ' r = 1 < ( 5 ) 

expressing the idea that after the tour the traveller 
inevitably returns into one of the previously visited 
nodes |13j . It is easy to demonstrate that the min- 
imal typical value, T m ; n = AHog-ZV, is achieved for 
the uniform distribution A^ = A^ 1 . It is also clear 
that t — ► oo if the network contains least frequently 
visited nodes for which Xi — > 0. 

However, in many situations, the arrival probabil- 
ities pi are not independent. It is intuitively clear 
that in real transport networks, the arrival rates pi 
at the different nodes are correlated in accordance to 
the chance to be joined by a path in the graph and 
therefore may be sensitive to the particular graph 
symmetries. In the forthcoming subsection, we de- 
rive the distribution A,; associated to the set of auto- 
morphisms of a simple graph G. 

2.2 Graph automorphisms and steady 
states 

Among all measures which can be defined on the 
set of nodes V, the set of normalized measures (den- 
sities) Xi > 0, i G V, satisfying ([2]), are of essential 
interest since they express the conservation of a quan- 
tity and therefore may be associated to some physical 
process. 

The fundamental physical process defined on the 
graph G(V, E) is generated by the subset of its auto- 
morphisms preserving the degrees of all nodes, ki. 
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For each graph G(V,E), there exists a unique, up 
to permutations of rows and columns, adjacency ma- 
trix A identified with a linear endomorphism of the 
vector space of all functions from V into R. In the 
special case of a finite simple graph (an undirected 
graph that has no self-loops), the adjacency matrix 
is a (0, l)-matrix such that A^ = li£i^j,ir^j J 
and = otherwise. The degree of a node i E V 
is therefore given by 



5> 



(6) 



The set of graph automorphisms, the mappings of 
the graph to itself preserving all of its structure, is 
specified by the symmetric group Sat including all 
admissible permutations II G Sjv taking the node i G 
V to n(i) G V. The representation of §/v consists of 
all N x N matrices Iln, such that (Iln)j n (i) = •"■> 
and (Hn)ij = if j ^ n(i). A linear transformation 
of the adjacency matrix 

N 

Z(A) (j = ^jsiA sl , T l]sl G K (7) 

s,i=l 



is a graph automorphism if 

n5z(A) n n = z(n5An n ) 



(8) 



for any II G Sat. The latter relation is satisfied if 
the entries of the tensor T in ([7]) meets the following 
symmetry property: 



(i)n(j)n( a )n(/) 



ijsl i 



(9) 



for any II G §at. The action of permu- 
tations preserves the conjugate classes of in- 
dex partition structures, so that any appropri- 
ate tensor T satisfying ([9]) can be expressed 
as a linear combination of the following tensors: 
{1, Sij,8i s ,6ii,8j s ,Sji,S s i,Sij8j s ,Sj s 6 s i,5 s iSii, SuSij, 
8ij8 s i, SigSji, SuSjs, SijSuSis} . 

Given a simple undirected graph G, then by substi- 
tuting the above tensors into ([7]) and taking account 
on symmetries we conclude that any arbitrary linear 
permutation invariant function Z (A) must be of the 
following form 



Z(A) 



ai + 5 lj (a 2 + a 3 kj) + 0,4 



(10) 



with kj = deg G (j) and 01,2,3,4 being arbitrary con- 
stants. 

If we require that the linear function Z preserves 
the notion of connectivity, 



ki — 



jev 



(11) 



it is clear that we should take a\ = a 2 = (indeed, 
the contributions aiN and 02 are incompatible with 
(JXTJ) ) and then obtain a relation for the remaining 
constants, 1 — 0,3 = CI4. Introducing the new param- 
eter P = 0,4 > 0, we reformulate (TIT))) as follows, 

Z{A) ij = {l-p)8 ij k j + PA ii . (12) 

If we express in the form of a probability con- 
servation relation, then the function Z (A) acquires 
a probabilistic interpretation. Substituting (|12J) back 
into HJ), we obtain 



1 = K'Ejev^iA) 



E 7 w (!-/?)<%+/? 
E 



T 



(0) 



(13) 



The operator is nothing else as the generalized 
random walk transition operator if < (3 < k^ x 
where /c max is the maximal degree in the graph G. In 
the "lazy" random walks defined by a random 

walker stays in the initial vertex with probability 1 — 
/3, while it moves to another node randomly chosen 
among its nearest neighbors with probability If 
we take (3 — 1, then the operator T^p describes the 
usual random walks extensively investigated in the 
classical surveys |14|-|16|. 

Being defined on a connected undirected graph, the 
matrix is a real positive stochastic matrix, and 
therefore, in accordance to the Perron-Frobenius the- 
orem [17] , its maximal eigenvalue equals 1, and it is 
simple. A left eigenvector, 

7rT (/3) = 7T, (14) 

associated with the maximal eigenvalue 1 has positive 
entries satisfying the normalization condition ^ in- 
dependently of the value /3, 

« = y^T- (15) 
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It is interpreted as the unique equilibrium state ir 
(the stationary distribution of random walks). If the 
graph G is not bipartite, any density a (<7j > 0, 
^2i£V °» = -0 asymptotically tends to the station- 
ary distribution under the action of the transition 
operator T-?\ 

7T = lim a (t^) * . (16) 

If G is regular, then tt is uniform. For a non-regular 
graph G, this property is replaced by time reversibil- 
ity (the balance equation) 

^T^ = ^\ i,jeV. (17) 

The stationary distribution of random walks tt is the 
unique density associated to the automorphisms of a 
simple undirected graph G. 

3 Small worlds of big cities 

The main focus of the space syntax study is on the 
relative proximity (or accessibility) between different 
locations and associating these distances with den- 
sities and intensities of human activity which occur 
at different open spaces and along the links which 
connect them [18l [19l [20] . The decomposition of the 
city space into a complete set of intersecting open 
spaces characterized by the certain traffic capacities 
produces a spatial networks which we call the dual 
graph representation of a city 

While identifying the spaces of motion which play 
the role of nodes in the dual graphs of the com- 
pact city patterns bounded by natural geographi- 
cal limitations, in the present paper we implement 
the street named oriented identification principle, the 
same which we used in our previous study [21) . The 
street named approach has been previously used by 
different authors, [23]- [25]. We have generalized their 
approach in a way to account the possible disconti- 
nuities of streets. Namely, we assign an individual 
street identification code (ID) to each continuous part 
of a street even if all of them share the same street 
name. Then the dual graph of the urban pattern is 
constructed by mapping spaces of motion coded with 



the same ID into nodes of the dual graph and in- 
tersections among each pair of individual spaces into 
edges connecting the corresponding nodes of the dual 
graph. 

The graph theoretic distance (also: depth) dy be- 
tween two locations in the urban pattern is the least 
number of steps (or the elementary navigation ac- 
tions) required to reach the node i from the node j 
in the dual graph representation of the city. In par- 
ticular, if two continuous streets intersect, then the 
distance between them equals one. Given the dual 
graph representation of a city, we can compute the 
distances between all pairs of its nodes. 

The number of neighbors a node has at a distance 
d in the dual graph quantifies the relative structural 
importance of the location in the urban environment, 
3 . The statistics of far away neighbors can be pre- 
sented in the form of cumulative distribution func- 
tions tyd(n) quantifying the probabilities that a node 
has at least n neighbors at a distance d. Being 
monotonous functions of n, cumulative distributions 
reduce the noise in the distribution tails, however the 
adjacent points on their plots are not statistically in- 
dependent. 



# far-away neighbors 




Bielefeld (downtown) 

Figure 1: The cumulative distributions of far-away 
neighbors in the dual graph of Bielefeld downtown. The 
cumulative degree distribution are shown in the first row. 
The cumulative distributions of far-away neighbors are 
encoded in the second and forthcoming rows. Probability 
is ranked from zero (dark blue) to 1 (red). 

For regular graphs, the number of far-away ncigh- 
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bors grows up exponentially fast with the distance. 
In Figs. [U [2 we have displayed the cumulative dis- 
tribution functions Pd(n) quantifying the probabili- 
ties of that an average street is interconnected with 
at least n other streets within a distance d calculated 
for the dual graphs of two German organic cities - the 
downtown of Bielefeld in Westphalia and Rothenburg 
ob der Tauber in Bavaria. It is obvious that most of 
the streets in these towns can be reached in three 
navigation steps. The compactness of German me- 
dieval towns (see Figs. [I] [2]) uncovers the historical 
functional background beyond their structure. 



# far-away neighbors 
10 29 30 40 50 




Rot^eiiDurg oD'aer^iaiiDer 



Figure 2: The cumulative distributions of far-away 
neighbors in the dual graphs of Rothenburg ob der 
Tauber, Bavaria. The cumulative degree distribution are 
shown in the first row. The cumulative distributions of 
far-away neighbors are encoded in the second and forth- 
coming rows. Probability is ranked from zero (dark blue) 
to 1 (red). 

The term of a burgh has been in use since the 12 th 
century when David I of Scotland created the first 
Royal burghs. Early burghs were granted the power 
to trade, which allowed them to control trade until 
the 19 th century, [27]. Their spatial structure was 
shaped by the public activities such as trade and ex- 
change - ordered in such a way as to maximize the 
presence of people in central areas. In such an organic 
city, the majority of streets are just by a few syntac- 
tic steps away from each other, so that the entire city 
is compact that can be clearly seen from statistics of 
far away neighbors shown in Figs. [TJU 



It is interesting to mention that in the larger ur- 
ban patterns such as the street grid in Manhattan 
and the canal network of Venice, most of the nodes 
of spatial networks can still be reached in less as three 
navigation steps (see. Figs. I3|4| ). However, there are 
also noticeable fractions of relatively isolated places. 
The structure of cumulative distributions shown in 

far-away neighbors 




Venice (canals) 



Figure 3: The cumulative distributions of far-away 
neighbors in the dual graph of Venetian canal network 
(96 canals). Probability is ranked from zero (dark blue) 
to 1 (red). 

Figs. [3J Q] suggests that the correspondent graphs 
contain many cliques (complete subgraphs), together 
with subgraphs being "almost" cliques. In complex 
network theory, this phenomenon is called the small 
world property. Small-world networks are character- 
ized by a small diameter and a high clustering co- 
efficient having connections between almost any two 
nodes within them. Hubs - nodes in the network serv- 
ing as the common connections mediating the short 
path lengths between other edges are commonly as- 
sociated with small- world networks, [28]. 

The tendency to shorten distances in urban space 
networks induced by the public processes is comple- 
mented in the "small world" cities (like those repre- 
sented in Fig. [3J 0]) with the residential process which 
shapes relations between inhabitants and strangers 
preserving the original residential culture against un- 
sanctioned invasion of privacy. While the majority 
of streets and canals characterized with an excellent 
accessibility promotes commercial activities and in- 
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ffi far-away neighbors 

20 40 60 80 




Manhattan 



Figure 4: The cumulative distributions of far-away 
neighbors in the dual graph of the street grid in Man- 
hattan. Probability is ranked from zero (dark blue) to 1 
(red). 



tcnsifies cultural exchanges, certain districts of such 
cities demonstrate the opposite tendency having res- 
idential areas relatively segregated from the rest of 
the city, [3]. 

The main technical problem of space syntax inves- 
tigation is to rank out the open spaces in the city in 
accordance to their accessibility. 



4 From random walks to Eu- 
clidean spaces 

The idea of using the spectral properties of self- 
adjoint operators in order to extract information 
about graphs is standard in spectral graph theory 
[29] and in theory of random walks on graphs [14]- 
[16] , In the following calculations, in the transition 
operator (|13p we take (3 = 1 that allows to compare 
directly the forthcoming formulas with those known 
from the classical surveys on random walks defined 
on graphs, [H]-[T6]. 

Among all measures which can be defined on the 
set of nodes V, there is one associated to the station- 
ary distribution of random walks, m = ^2 ieV ^iSi, 
in which Si is the vector of the canonical basis that 
equals 1 at i € V and zero otherwise, with respect 
to which the transition operator ( (fl3|> with (3 = 1) is 



sclf-adjoint in the Hubert space H(V), 



(18) 



In the above equation, T T is the adjoint opera- 
tor, and 7T is considered as the diagonal matrix 
diag (7ri, . . . , 7r/v). In particular, the elements of 
the symmetric transition operator defined on simple 
undirected graphs equal 



T — 



y ki kj 



iff 



(19) 



and zero otherwise. 

The orthonormal ordered set of real eigenvectors 
ipi, i = 1 ... N, of the symmetric transition operator 
T forms a basis in Hilbert space Ti(V). The com- 
ponents of the first eigenvector tpi belonging to the 
largest eigenvalue ju,i = 1, 



(20) 



describes the connectivity of nodes (iii cx hi). The 
Euclidean norm of the orthogonal complement of 



ipi, Yl s =2' t l J si = 1 — Tfi) quantifies the probabil- 
ity that a random walker is not in i. The eigen- 
vectors, {^1^21 belonging to the eigenvalues 1 > 
/i 2 > ■ ■ • /i 7v > ~1 describe the connectedness of the 
graph G. The exterior products of vectors calculated 
by means of determinants are used in Euclidean ge- 
ometry to study areas, volumes, and their higher- 
dimensional analogs, [30]. Denote by C m the set of 
all subsets of V containing precisely 1 < m < N 
nodes. It is obvious that the size \C m \ is given by the 
binomial coefficient, 



m\(N - to)! 



Let us note that the eigenvectors ipi, i = 1...N, 
induce basis systems for the spaces of all square 
summable TO-point functions / : {ij, . ..i m S V} — > 
M. We consider the matrix of eigenvectors of the 
symmetric transition operator (|20p . 



* = 



\ IpN.l 




(21) 
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in which the rows {^fc}fc=i are orthonormal, 
(i/ji, ipj) = Sij. It is well known that det = ±1 and 
numerically equal to the content of the parallelotope 
spanned by the unit eigenvectors {ipk}u=i- 

The nice property of minors of order m 

obtained by the deletion of to rows {s;}, I — 1, . . . m, 
and columns {i;}, I = l,...m, from det \& is that 
they are equal to their algebraic complements in 
multiplied by ±1, [31] . Moreover, the complete set 
of minors of order m forms an orthonormal system of 
to— point antisymmetric functions, 



E 

H , ■ ■ -in 



Si , . . .s, 



■'•si si 



(22) 



and can therefore be chosen as the basis in the Eu- 
clidean space Rl^™!, [3T]. In the next section, we 
use the minors of order one to construct the embed- 
ding of undirected graphs into Euclidean space. In 
particular, we demonstrate that since any density a 
taken as an initial distribution of random walkers on 
an undirected graph converges to the unique station- 
ary distribution 7r as t — > oo, this Euclidean space is 

The orthonormal systems of functions (f2"2"|) are use- 
ful for decomposing the data when analyzing the 
traffic properties of transport networks. The data 
(say, of the real-time traffic flows and speed measure- 
ments) from to different detectors taken at different 
time slices can be uniquely decomposed using the or- 
thonormal system (|22[) into smaller, more manage- 
able parts and then compared allowing therefore an 
effective classification of traffic speed patterns. 



5 Euclidean embedding 

of graphs by random walks 

In the present section, we consider the orthogonal 
system of minors of order one, \f r *, which are numer- 
ically equal to the correspondent components of the 
eigenvectors ips,i- I n particular, it is clear that the 
minors (^\) . = 7r$ in accordance to ([20)) . The mi- 
nors of order one (m = 1) are associated to the nodes 
of the graph and provide the basis for the Hilbcrt 



space Tt(V). We demonstrate that a simple undi- 
rected graph can be embedded into the (TV — 1) Eu- 
clidean space, so that for any pair of nodes the dis- 
tance and angle can be uniquely defined. 

It is important to note that Markov's symmetric 
transition operator T defines a projection of any den- 
sity a S TC(V) on the eigenvector tpi related to the 
stationary distribution 7r, 



oT = Ipi + <7j_ T, <7_|_ 



(23) 



in which a± is the vector belonging to the orthog- 
onal complement of tp 1 . It follows from (| 16|) that 
lim^oo o \T l = 0, so that the vector a± character- 
izes the transient process induced by a. 

In space syntax, we are interested in a comparison 
between the different densities defined on the nodes 
of the graph. It is convenient to compare them in ac- 
cordance to their proximity to the unique stationary 
distribution of random walks n defined on the graph 
G. Since all components tpu > 0, we can rescale the 
density a by dividing its components by the compo- 
nents of tpi , 

* = -P- = (24) 

It is clear that any two rescaled densities a,pEH 
differ with respect to random walks only by their dy- 
namical components orthogonal to tpi , 

(a - p) T* = (a ± - p ± ) f\ 

for all t > 0. 

Therefore, we can define the distance || . . . \ \t based 
on random walks between any two densities by 



P 



E 

t > 



cr± - Pa 



cr± - pi 



(25) 



or, using the spectral representation of T, 



T-P\\ 

E 



N 



JV 



\i>s){i> s 



(cr±-p ± | ip a )( xjj s <Tj_ — pj_ ) 



(26) 

where we have used Dirac's bra-ket notations espe- 
cially convenient for working with inner products and 
rank-one operators in Hilbert space. 
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If we introduce a new inner product for densities 
a,p£ H(V) by 

{p,P)t = 2^ T^TT, ' ( 27 > 

then (|26|) is nothing else but 

Ik-Pllr = lkllT + l|p|lr-2(ff,p) T » (28) 
where 

lkllr= £ ^4^^ (29) 

is the square of the norm of a E H(V) with respect 
to random walks defined on the graph G. 

We conclude the description of the (N — 1)- 
dimensional Euclidean space structure of G induced 
by random walks by mentioning that given two den- 
sities a,/) e the angle between them can be 
introduced in the standard way, 

co S Z(p,a)= { °> P 1 T . (30) 
Ikllr IMIt 

Random walks embed connected undirected graphs 
into the Euclidean space R^ -1 . This embedding can 
be used in order to compare the accessibility proper- 
ties of individual nodes and to construct the optimal 
coarse-graining representations of the transport net- 
works. 

Indeed, the vector Si of the canonical basis is the 
particular case of density, which is zero for all nodes 
excepting for the node i E V where it equals 1. In 
accordance to (|29l) . the density Si acquires the norm 
|j Si \\ T associated to random walks defined on G. In 
the theory of random walks p3j, its square, 

1 N 

ikii^Ei^ (3D 

gets a clear probabilistic interpretation expressing the 
spectral formula of access time to the node i E V, 
the expected number of steps required for a random 
walker to reach the node i starting from an arbitrary 



node chosen randomly among all other nodes of the 
graph G with respect to the stationary distribution of 
random walks tt. The squared norm of the canonical 
vector || Si ||^ can be used in order to characterize the 
access to the node i from other nodes of the graph G. 
By the inequality between arithmetic and harmonic 
means, it is easy to prove [14] that 

\\5 1 \\ 2 t> {1 ^^, (32) 

so that the nodes which are difficult to reach <C 1) 
are also isolated from the rest of the graph. 

The Euclidean distance between any two nodes of 
the graph G is calculated as the distance ([2"B)l induced 
by random walks between two canonical vectors Si 
and Sj, 

(33) 

gives the spectral representation of commute time de- 
fined in the theory of random walks on undirected 
graphs as the expected number of steps required for 
a random walker starting at i E V to visit j E V 
and then to return back to i, [T4] . 

The commute time which plays the role of the Eu- 
clidean distance between the nodes of the graph can 
be represented as a sum, K^j — Hij + Hj t i, in which 

Hi* = lk<||T-(*<.*j)r ( 34 ) 

is the first hitting time which plays probably the most 
important role in the quantitative theory of random 
walks [ 14"]. It quantifies the expected number of steps 
before node j is visited if starting from node i. The 
first hitting time Hij satisfies the equation 

with the intimal condition Hi^ = 0. The equation 
([55]) expresses the fact that the first step takes the 
random walker to a neighbor I ~ i and then he has 
to reach j from there. 

It is important to mention that the cosine of an 
angle calculated in accordance to ([30)) has the struc- 
ture of Pearson's coefficient of linear correlations that 
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reveals it's natural statistical interpretation. The no- 
tion of angle between any two nodes of the graph 
arises naturally as soon as we become interested in 
the strength and direction of a linear relationship 
between two random variables, the flows of random 
walks moving through them. If the cosine of an angle 
(|30p is 1 (zero angles) , there is an increasing linear re- 
lationship between the flows of random walks through 
both nodes. Otherwise, if it is close to -1, there is a 
decreasing linear relationship. The correlation is 
if the variables are linearly independent. It is impor- 
tant to mention that as usual the correlation between 
nodes does not necessary imply a direct causal rela- 
tionship (an immediate connection) between them. 

6 Petersen graph and Venetian 
Canals 

In the present section, we construct and investigate 
the Euclidean embedding of two graphs. The first 
one we study is the Petersen graph with 10 nodes 
(see Fig. [5]). Another example is the spatial net- 
work of 96 Venetian canals which serve the function 
of roads in the ancient city that stretches across 122 
small islands (see Fig. [5]). While identifying canals 
over the plurality of water routes on the city map 
of Venice, the canal-named approach has been used, 
in which two different arcs of the city canal network 
were assigned to the same identification number pro- 
vided they have the same name. The Petersen graph 
is a regular graph, fcj = 3, i = 1, . . . 10, so that 
J2i ki = 30, and the stationary distribution of ran- 
dom walks is uniform, 7T t - ^ =0.1. The spectrum 
of the random walk transition operator (fTS)) defined 
on the Petersen graph contains the Perron eigenvalue 
/ii = 1 which is simple, then the eigenvalue = 1/3 
with multiplicity 5, and 113 = —2/3 with multiplicity 
4. Therefore, there are just 3 linearly independent 
eigenvectors, and two eigensubspaces for which the 
orthonormal basis vectors can be calculated, so that 
the resulting matrix of eigenvectors and basis vectors 
which we use in (|3 lti33[) always has full column dimen- 
sion. Random walks embed the Petersen graph into 
a 9-dimensional Euclidean space, in which all nodes 
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Figure 5: The Petersen graph. 

have equal norm (|3"Tj) . ||i|| T = 3.14642654 meaning 
that the expected number of steps a random walker 
starting from a node chosen randomly with probabil- 
ity p — 0.1 reaches any node in the Petersen graph 
equals 9.9. Indeed, the structure of 9-dimensional 
vector space induced by random walks defined on the 
Petersen graph cannot be represented visually, how- 
ever if we choose one node as a point of reference, we 
can draw its 2-dimensional projection by arranging 
other nodes at the distances calculated accordingly 
to (|33|) and under the angles (|30| they are with re- 
spect to the chosen reference node (see Fig. [7]). 

It is expected that a random walker starting at the 
node 1 visits any peripheral node (2, 3, 4, 5) and then 
returns back in 18 random steps, while 24 random 
steps are expected in order to visit any internal node 
in the deep of the graph (6, 7, 8, 9, 10). It is also ob- 
vious that while the linear relationship between the 
random walks flows through the node 1 and those 
through any peripheral node is positive, it is negative 
with respect to the flows passing through the internal 
nodes. Due to the symmetry of the Petersen graph, 
the figure displayed on Fig. [7] is essentially the same if 
we draw it with respect to any other periphery node 
(2, 3, 4, 5). It is also important to note that it turns to 
be mirror-reflected if we draw it with respect to any 
internal node (6,7,8,9,10). Therefore, we can con- 
clude that the Petersen graph contains two groups of 
nodes, at the periphery and in the depth, which ap- 
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Figure 6: The dual graph representation of the spatial 
network of 96 Venetian canals 



pears to be as much as a quarter more isolated from 
one another than the nodes within each group (18 
random steps vs. 24 random steps). It is clear that 
the 9— dimensional embedding of the Petersen graph 
into Euclidean space is characterized by the highest 
degree of symmetry. 

The dual graph of the spatial network of Venetian 
canals (FigO is much more complicated than the Pe- 
tersen graph. We construct it by mapping every canal 
into an individual node of the dual graph and con- 
necting the pairs of nodes by arcs when these canals 
intersect. The resulting graph is far from being reg- 
ular, so that the stationary distribution of random 
walks denned on it is not uniform. In [21j . we have 
discussed that it is not evident if the degree distri- 
butions observed in compact urban patterns and in 
the Venetian canal network, in particular, follow a 
power law. The spectrum of the symmetric Markov 
transition operator (JT5J) defined on that is presented 
in Fig. [5] The transition matrix T for the canal 
network in Venice is strongly defective. In partic- 
ular, it contains the eigenvalue fi = with multi- 
plicity 22. This degeneracy indicates the presence 
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Figure 7: The Euclidean space embedding of the Pe- 
tersen graph drawn with respect to the node #1. 




Figure 8: The spectrum of the Markov transition opera- 
tor (|19[) defined on the spatial network of Venetian canals. 



of a complete bipartite subgraph in the spatial net- 
work of Venice shown in Fig. [SJ In urban spatial 
networks encoded by their dual graphs, the values of 
access time (|31j) vary strongly from one canal to an- 
other being very long for topologically isolated places. 
Three data points characterized by the shortest ac- 
cess times shown in Fig. |H] are associated to the La- 
goon of Venice, the Giudecca canal, and the Grand 
canal - the most central water routes in the city canal 
network. Four data points characterized by the worst 
accessibility represent the canal subnetwork of Vene- 
tian Ghetto. 

The values of access times observed in the dual 
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\ 

Figure 9: The scatter plot (in the log- log scale) of the 
connectivity vs. the value of access time to a node in the 
dual graph representation of 96 Venetian canals. Three 
data points characterized by the shortest access times rep- 
resent the main water routes of Venice: the Lagoon of 
Venice, the Giudecca canal, and the Grand canal. Four 
data points of the worst accessibility are for the canal sub- 
network of Venetian Ghetto. The slope of the regression 
line equals 2.07. 

graph of Venetian canals scale with the connectivity 
of canals: the slope of the regression line equals 2.07. 

A similar power law can be observed for the isola- 
tion patterns in other urban environments. For ex- 
ample, in Fig. [TQl we have sketched the scatter plot 
of the connectivity vs. the norm a node in the dual 
graph representation of 50 streets in the downtown of 
Bielefeld. Professor Dr. Ludwig Streit works at the 
University of Bielefeld since 1972. 

In the space syntax analysis, the mean shortest dis- 
tance from a given node i to any other node in the 
graph, 

is used for characterizing the level of integration of 
the node in the city pattern, [22]. The relation be- 
tween the mean shortest distance to a node ii and the 
value of access time to the same node \\6i\\^ is very 
complicated and strongly depends on the topology of 
the graph. In Fig. [IT] we have shown the scatter plot 



.2e3\ 
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Figure 10: The scatter plot of the connectivity vs. the 
norm a node in the dual graph representation of 50 streets 
in the downtown of Bielefeld. The slope of the regression 
line equals 2.17. 



(in the log-log scale) of the mean shortest distance vs. 
the value of access time for the network of Venetian 
canals. The plot Fig. [TT1 indicates a positive relation 
between ii and ||#i|| T with the slope of the regression 
line equals 0.18. 

The 2-dimensional projection of the Euclidean 
space of 96 Venetian canals set up by random walks 
drawn for the the Grand Canal of Venice (the point 
(0,0)) is shown in Fig. [T2J Nodes of the dual graph 
representation of the canal network in Venice are 
shown by disks with radiuses taken equal to the de- 
grees of the nodes. All distances between the chosen 
origin and other nodes of the graph (Fig. [Tj]) have 
been calculated in accordance to ([55]) and ([50]) has 
been used in order to compute angles between nodes. 
Canals negatively correlated with the Grand Canal of 
Venice are set under negative angles (below the hori- 
zontal), and under positive angles (above the horizon- 
tal) if otherwise. It is evident from Fig. [T2]that disks 
of smaller radiuses demonstrate a clear tendency to 
be located far away from the origin being charac- 
terized by excessively long commute times with the 
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Figure 11: The scatter plot (in the log- log scale) of the 
mean shortest distance vs. the value of access time for the 
network of Venetian canals. The plot indicates a positive 
relation with the slope of regression line equals 0.18. 

reference node (the Grand canal of Venice), while the 
large disks which stand in Fig. [12] for the main wa- 
ter routes are settled in the closest proximity to the 
origin that intends an immediate access to them. 

7 Discussion and Conclusion 

From Euler's time, urban design and townscape 
studies were the sources of inspiration for the net- 
work analysis and graph theory. It is common now 
that networks are the reality of urban renewals [32] . 
Flows of pedestrians and vehicles through a city are 
dependent on one another and that requests for or- 
ganizing them in a network setting. Moreover, the 
networking is structurally contagious. In order to 
be able to master a network effectively, an authority 
should also constitute a network structure, probably 
as complicated as the one that it supervises. A com- 
plex network of city itineraries that we can experi- 
ence in everyday life appears as the result of multiple 
complex interactions between various transport, so- 
cial, and economical networks. 

In the present paper, we have developed a self- 
consistent approach for exploring the structure of ur- 



Figure 12: The 2-dimensional projection of the 95- 
dimensional Euclidean spaces associated to random walks 
defined on the city canal network built from the perspec- 
tive of the Grand canal of Venice chosen as the origin. 
The labels of the horizontal axes display the expected 
number of random walk steps. The labels of the vertical 
axes show the degree of nodes (radiuses of the disks). 



ban environments based on the use of Markov chains 
(random walks) defined on the dual graph of the city. 
The stochastic transition matrix of random walks 
gives a matrix representation for the group of au- 
tomorphisms of the graph G(V,E). The expected 
number of steps required to reach a node of the graph 
starting from a node chosen from the stationary dis- 
tribution over V given by (fBTj) can be considered as 
a characteristic time scale 7jj] = ||<5i|| T quantifying 
the accessibility of the node i in the graph G by 
random walks. The data of scatter plots displayed 
in Figs. 191 1 01 show that in the studied urban pat- 
terns 7[j] scales with the degree of vertex, so that 
T(k) cx k- 2 - 2E , with some e > (e = 0.085 for 
the downtown of Bielefeld and e = 0.035 for the 
canal network of Venice). We have also discussed 
in Sec. 12.11 that in the free flow regime of a queuing 
network preserving the proportion of time Aj spent 
in i £ V constant (which then can be considered as 
the normalized length distance of the given street i), 
the expected first passage time to the node from a 
node random chosen among all nodes of the queuing 
network equals A" 1 . While equalizing these passage 
times, we obtain that in the free flow regime the pro- 
portion of time A spent by a moving agent in an open 
place of the given transport network scales with its 
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connectivity as 

A(fc) cx k 2+2e , (37) 

where the value e > is determined by the entire 
topology of the city. 

It is well documented that isolation worsens an 
area's economic prospects by reducing opportunities 
for commerce, and engenders a feeling of isolation 
for the inhabitants, both of which can fuel poverty 
and crime [3]. The method we have introduced in 
the present paper could easily be used to identify 
isolated neighborhoods in big cities with a complex 
web of roads, walkways and public transport systems 
spotting hidden areas of geographical isolation in the 
urban landscape. The method can be used to analyze 
complex transport networks of any type. 
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